AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Document image understanding

# Document image understanding

Paligemma Rich Captions
Apache-2.0
An image caption generation model fine-tuned on the DocCI dataset based on PaliGemma-3b, capable of generating detailed descriptions of 200-350 characters with reduced hallucination
Image-to-Text Transformers English
P
gokaygokay
66
9
Donut Base Finetuned Latvian Receipts V2
MIT
A model based on the Donut architecture, specifically fine-tuned for Latvian receipt data
Text Recognition Transformers
D
Inesence
13
0
Donut Base Finetuned Latvian Receipts
MIT
This model is a fine-tuned version of donut-base on a Latvian receipt dataset, primarily used for receipt image processing tasks
Text Recognition Transformers
D
Inesence
31
0
Donut Base Payslips
MIT
Document understanding model based on Donut architecture, specifically fine-tuned for payslip image processing
Text Recognition Transformers
D
Assadullah
20
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase